New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

daemonset: differentiate between cases in nodeShouldRun #38787

Merged

k8s-github-robot merged 2 commits into kubernetes:master from mikedanese:ds-fix2

Jan 11, 2017

Member

mikedanese commented Dec 14, 2016 •

edited

specifically we need to differentiate between wanting to run,
should run and should continue running. This is required to
support all taint effects and will improve reporting and end
user debuggability.

fixes #28839 among other things

mikedanese added the release-note-none label

k8s-ci-robot added the cncf-cla: yes label

k8s-github-robot assigned bprashanth

k8s-github-robot added the size/L label

mikedanese force-pushed the ds-fix2 branch from 8526cb3 to ad952d1 Compare

December 14, 2016 22:37

Member Author

mikedanese commented Dec 14, 2016

Member Author

mikedanese commented Dec 15, 2016 •

edited

@kubernetes/daemonset @kubernetes/sig-apps please review.

k8s-reviewable commented Dec 15, 2016

This change is

smarterclayton reviewed

View reviewed changes

Contributor

smarterclayton left a comment

Much cleaner, request for a slightly better test for the new method. But if it's covered sufficiently in your mind I'm ok with as is.

pkg/controller/daemon/daemoncontroller.go

+              // * shouldContinueRunning:
+              //     Returns true when a daemonset should continue running on a node if a daemonset pod is already
+              //     running on that node.
+              func (dsc *DaemonSetsController) nodeShouldRunDaemonPod(node *v1.Node, ds *extensions.DaemonSet) (wantToRun, shouldSchedule, shouldContinueRunning bool, err error) {

Contributor

smarterclayton Dec 16, 2016

Can I ask you to do a table test for this method that shows the outcomes?

Member Author

mikedanese Dec 16, 2016

Will do.

bprashanth assigned smarterclayton and unassigned bprashanth

0xmichalis reviewed

View reviewed changes

pkg/controller/daemon/daemoncontroller.go

               	// If the daemon set specifies a node name, check that it matches with node.Name.
               	if !(ds.Spec.Template.Spec.NodeName == "" || ds.Spec.Template.Spec.NodeName == node.Name) {
-              		return false
+              		wantToRun, shouldSchedule, shouldContinueRunning = false, false, false
+              		return

Contributor

0xmichalis Dec 16, 2016

Can you inline the returned values like you do below?

return false, false, false, nil

0xmichalis reviewed

View reviewed changes

pkg/controller/daemon/daemoncontroller.go

               	}
               	// TODO: Move it to the predicates
               	for _, c := range node.Status.Conditions {
               		if c.Type == v1.NodeOutOfDisk && c.Status == v1.ConditionTrue {
-              			return false
+              			// the kubelet will evict this pod if it needs to. Let kubelet
+              			// decide wether to continue running this pod so leave shouldContinueRunning

Contributor

0xmichalis Dec 16, 2016

s/wether/whether/

0xmichalis reviewed

View reviewed changes

pkg/controller/daemon/daemoncontroller.go

+              				predicates.ErrPodAffinityNotMatch,
+              				predicates.ErrServiceAffinityViolated,
+              				predicates.ErrTaintsTolerationsNotMatch:
+              				return false, false, false, fmt.Errorf("unexpected reason: GeneralPrdicates should not return reason %s", reason.GetReason())

Contributor

0xmichalis Dec 16, 2016

typo

0xmichalis reviewed

View reviewed changes

pkg/controller/daemon/daemoncontroller.go

+              				predicates.ErrMaxVolumeCountExceeded,
+              				predicates.ErrNodeUnderMemoryPressure,
+              				predicates.ErrNodeUnderDiskPressure:
+              				shouldSchedule = false

Contributor

0xmichalis Dec 16, 2016 •

edited

Can you make it explicit that the other two booleans are true here?

0xmichalis reviewed

View reviewed changes

pkg/controller/daemon/daemoncontroller.go

               	for i := range dsList.Items {
               		ds := &dsList.Items[i]
-              		shouldEnqueue := (dsc.nodeShouldRunDaemonPod(oldNode, ds) != dsc.nodeShouldRunDaemonPod(curNode, ds))
-              		if shouldEnqueue {
+              		_, oc1, oc2, err := dsc.nodeShouldRunDaemonPod(oldNode, ds)

Contributor

0xmichalis Dec 16, 2016

It's not clear what the names mean. Maybe oldShouldSchedule, oldShouldContinueRunning? Same for the names below.

mikedanese force-pushed the ds-fix2 branch 2 times, most recently from b514bd1 to 9cc8c7c Compare

January 9, 2017 22:55

This was referenced Jan 11, 2017

[DaemonSet] Consider log out more failed reasons of ds in DaemonSet Controller #39603

Closed

DaemonSet PODs randomly fail to start on management nodes after reboot #36482

Closed

mikedanese force-pushed the ds-fix2 branch from 9cc8c7c to 930b815 Compare

January 11, 2017 19:57

Member Author

mikedanese commented Jan 11, 2017

@Kargakis @smarterclayton addressed all comments.

0xmichalis reviewed

View reviewed changes

pkg/controller/daemon/daemoncontroller.go

+              				// this one is probably intentional since it's a workaround for not having
+              				// pod hard anti affinity.
+              				predicates.ErrPodNotFitsHostPorts:
+              				wantToRun, shouldSchedule, shouldContinueRunning = false, false, false

Contributor

0xmichalis Jan 11, 2017

Is node affinity something that doen't hold true for DaemonSets and works only for pods that are processed by the scheduler?

Member Author

mikedanese Jan 11, 2017

@Kargakis node affinity bubbles up here as a predicates.ErrNodeSelectorNotMatch. The DaemonSet obeys "required during scheduling" node affinities.

Contributor

0xmichalis Jan 11, 2017

Yes, actually nevermind my question. Required/ignored during execution is yet to be implemented.

Contributor

0xmichalis Jan 11, 2017

Basically "required during execution" only is not implemented yet. Here you have implemented "required during execution" for daemon sets. My real question from the begining was "Is node affinity going to affect daemon sets? Will I be able to specify predicates in the pod template of a daemon set?"

Member Author

mikedanese Jan 11, 2017

Yes, the plan is to support node affinity in daemonsets. We will need to modify this method when "required during execution" is implemented.

0xmichalis self-assigned this

Contributor

0xmichalis commented Jan 11, 2017

/lgtm

k8s-ci-robot added the lgtm label

mikedanese added 2 commits

January 11, 2017 13:37


          daemonset: differentiate between cases in nodeShouldRun

c518e89

secifically we need to differentiate between wanting to run,
should run and should continue running. This is required to
support all taint effects and will improve reporting and end
user debuggability.


          add table test for should run predicates

df0f4bd

mikedanese force-pushed the ds-fix2 branch from 930b815 to df0f4bd Compare

January 11, 2017 21:37

mikedanese removed the lgtm label

mikedanese added the lgtm label

k8s-github-robot commented Jan 11, 2017

Automatic merge from submit-queue (batch tested with PRs 39483, 39088, 38787)

k8s-github-robot merged commit 1747db8 into kubernetes:master

mikedanese deleted the ds-fix2 branch

January 11, 2017 23:37

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment